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The paper aims to showcase work on improvement over SHA-512 with proposed work BLAKE-512 hash 
function. Hash functions form an important category of cryptography, which is widely used in a great number of protocols 
and security mechanisms. It is defined as computationally efficient function, which maps binary strings of arbitrary length 
to binary strings of fixed length. The last ones are the outputs of a hash computation and they are called hash values. 
Hash functions are applied to support digital signatures, data integrity, random number generators, authentication schemes, 
and data integrity mechanisms. National Institute of Standard and technology (NIST) focus on the new SHA-3 
competition, started by the NIST, which searches for a new hash function in response to authentication concerns regarding 
the previous hash functions SHA-1 and the SHA-2 family. The paper aims to show that since SHA-512 used 80 rounds to 
calculate 512 bit final hash value with 512 bit initial hash value whereas BLAKE-512 is used with 16 rounds to calculate 
final hash value with same input length and same output length in C programming language. 
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INTRODUCTION 

Hash functions form an important category of cryptography, which is widely used in a great number of protocols 
and security mechanisms. Hash functions are mostly used to accelerate table lookup or data comparison tasks such as 
finding items in a database, detecting duplicate or similar records in a large file, finding similar stretches in DNA 
sequences, and so on. 

Hash Function BLAKE Consists of Four Steps 

Step 1: Padding the Message: For BLAKE-256 the message is extended so that its length is congruent to 447 
modulo 512. Length extension is performed by appending a bit 1 followed by a sufficient number of 0 bits. 
For BLAKE-512, message padding goes as follows: append a bit 1 and as many 0 bits until the message bit length is 
congruent to 895 modulo 1024. In the BLAKE-224 padded data, the 1 bit preceding the message length is replaced by a 0 
bit. In the BLAKE-384 padded data, the preceding the message length is replaced by 0 bit. 
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Step 2: Parsing the Padded Message: In BLAKE-256, padded message is split into 16-word blocks m", . . ., 
We let 1' be the number of message bits in m", . . .., m', that is, excluding the bits added by the padding, 
hi BLAKE-224, the output is truncated to its first 224 bits. In BLAKE-384, the output is truncated to its first 384 bits. In 
BLAKE-256, the output is truncated to its first 256 bits. 

Setting the initial hash value: BLAKE-512 uses the initial values of SHA-512. BLAKE-224 uses the eight initial 
values of SHA-224. BLAKE-384 uses the initial value of SHA-384. BLAKE-256 uses the eight initial values of SHA-256. 

Step 3: Message Digest: In BLAKE-256 the iterated hash returns ho^ , h/'. In BLAKE-224 iterated hash 

returns ho^ , hg^ In BLAKE-384 iterated hash returns ho^ , h^^. 
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Figure 1 

BLAKE-512 operates on 64-bit words and returns a 64-byte hash value. All lengths of variables are doubled 
compared to BLAKE-256: chain values are 512-bit, message blocks are 1024-bit, state is 256-bit, counter is 128-bit. 

Step 4: Compression Function: The compression function of BLAKE-512 is similar to that of BLAKE-256 
except that it makes 16 rounds instead of 14. 

• It uses the initial values of SHA-512. 

• BLAKE-32 and BLAKE-64 uses all outputs (256-bit & 5 12-bit each one) bits. 

• Hashing the message 

• For BLAKE-512, message padding goes as follows: append a bit 1 and as many 0 bits until the message bit length 
is congruent to 895 modulo 1024. Then append a bit 1, and a 128-bit unsigned big-endian representation of the 
message bit length:- 



m^ mil 1000 0001||<l>i 
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PROPOSED WORK 

BLAKE-512: According to previous work of SHA-512, it uses the 80 rounds to generate the final hash value 
(h) with 1024 bit block size, 512 bit digest, 64 bit word length, 128 bit message length and 80 rounds but BLAKE-512 
uses 16 rounds to generate final hash value (h ) with same block size, digest, word length and message length. The initial 
hash value H*"* is the following sequence of 64-bit words (which are obtained by taking the fractional parts of the square 
roots of the first eight primes): 

Initial Values 

IVo= 6A09E667 IV i= BB67AE85 
IV2= 3C6EF372 IV3= A54FF53A 
IV4= 510E527F IV5= 9B05688C 
IV6= 1F83D9AB IV7=5BE0CD19 



Table 1: Properties of SHA Hash Functions 
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Table 2: Properties of BLAKE Hash Functions (Sizes in Bits) 
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Table 3: Rounds of SHA-512 and BLAKE-512 
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Table 3 shows the 64 rounds is used in SHA-256 and 14 rounds is used in BLAKE-256, other properties are same 
for both candidates. 

BLAKE-512 

We have proposed three functions for calculation of final hash value. These three functions are following: 
Initialization 

This function uses the initial values of SHA-512. 
Initial Values 

The hash function BLAKE-512 operates on 64-bit words and returns a 32-byte hash value. This part defines 
BLAKE-512, going from its constant parameters to its compression function, then to its iteration mode. 

BLAKE-512 starts hashing from the same initial value as SHA-512: 

IVo= 6A09E667 IV i= BB67AE85 

IV2= 3C6EF372 IV3= A54FF53A 

IV4= 510E527F IV5= 9B05688C 

IV6= 1F83D9AB IV7= 5BE0CD19 

A 16-word state Vq. . . Visis initialized such that different inputs produce different initial states. The state is 
represented as a 4x4 matrix shows in figure 6.1. 
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Figure 2: 4 x 4 Matrix 

In the initiaUzation stage a large inner state is initialized from the previous chain value (h), salt (s), and counter (t). 
For the implementation of Vg,..., V15 a XOR chain is used, between the values of salt (So,...,S3) and the constant c (co,...,C7). 
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Figure 3: Blake-64 Initialization Unit Architecture 

Round Function 

The state value is then updated by message-dependent rounds. 



Once the state v is initialized, the compression function iterates a series of 16 rounds. It is finally compressed to 
create the next chain value. In every round, the state v is transformed based on addition, XOR and right rotation 
computations, which are the components of Gi (i=0,...,7) functions. For this purpose the Gj functions are used. When the 
round sequence is taken over, the new chain value h' = h'o, h'y is produced from the state v, with inputs of the initial 
chain value ho, ■■■,h^ and the salt s = Sq, . . ., S3. Table 4 shows the permutation table which is used for solving m^r and Ccr in 
below equations (1-8): 

Table 4: Permutation Table 



crc 


0 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


O"! 


14 


10 


4 


8 


9 


15 


13 


6 


1 


12 


0 


2 


1 1 


7 


5 


3 


IJ2 


1 1 


8 


12 


□ 


5 


2 


15 


13 


10 


14 


3 


6 


7 


1 


9 


4 


Oi 


7 


9 


3 


1 


13 


12 


1 1 


14 


2 


6 


5 


10 


4 


0 


15 


a 


04 


9 


0 


5 


7 


2 


4 


10 


15 


14 


1 


1 1 


12 


6 


8 


3 


13 




2 


12 


6 


10 


0 


1 1 


8 


3 


4 


13 


7 


5 


15 


14 




9 




12 


5 




15 


14 


13 


4 


10 


0 


7 


6 


3 


9 


2 


8 


1 1 


ay 


13 


11 


7 


14 


12 


1 


3 


9 


5 


0 


15 


4 


8 


6 


2 


10 


o"a 


6 


15 


14 


9 


1 1 


3 


0 


8 


12 


2 


13 


7 


1 


4 


10 


5 


0-9 


10 


2 


8 


4 


7 


6 


1 


5 


15 


1 1 


0 


14 


3 


12 


13 


0 



Each round of transformation is made of eight operations using the Gi functions defined as follows or a round is a 
transformation of the state v that computes as. 

Go(Vo, V4, Vg, V12) Gl(Vi, V5, V9, V13) G2(V2, Vg, Vio, V14) G3(V3, V7, Vii, V15) 
G4(Vo, V5, Vio, V15) G5(Vi, Vg, Vii, V12) G6(V2, V7, Vg, V13) G7(V3, V4, V9, V14) 

where, at round r, Gi(a, b, c, d) sets: 

a^a + h + (m„X2i) XORc,,, (2iH-l)) 1 

d^ (dXORa)»> 16 2 

c^ (c H- d) 3 
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b^(b XORc) »>12 

a^a + b + (mJ2i+l) XORc^^ (2i)) 

(d XORa) »> 8 

(c + d) 
b^(b XORc) »>7 



4 
5 
6 
7 



From the above equations (1-8) is obvious that for each one of the variables a,..., d two district values are set 
during one round transformation [1]. For each one of Gi functions an alternative implementation is proposed. All of them 
could be achieved based on the same generic Gj architecture, which is basically consists of n-bit modulo adders, 
XOR chains and, shift operations and registers. In order to implement the Gi functions in below figure 6.3 
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Figure 4: Gifunction Structure 







V2 








V4 




















Vl4 


Vis 















Figure 5: Gi Functions Transformation: (a) Column Step (b) Diagonal Step 



In figure 6.4 shows the first four calls Go,..., G3 can be computed in parallel. We call the procedure of computing 
Go,. . ., G3 a column step. The last four calls G4,. . ., G7 update distinct diagonals thus can be parallelized as well, which we 
call a diagonal step. At round r > 9, the permutation used is Omiod 10. 

Finalization 

After the rounds sequence, the new chain value h'o, . . ., h'yis extracted from the state Vq, . . ., Viswith input of the 
initial chain value ho, . . ., hyand the salt So, . . ., Ssas follows [3]: 
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h 0^ hoXORso XOR VqXOR Vg 
h'l^ hiXORsi XORviXORvg 
h'2^ hzXORsz XORV2XORV10 

hjXORsj XORV3XORV11 
h h4XORso XORV4XORV12 

hjXORsi XORV5XORV13 
h'g^ hgXORsz XORvgXORvM 
h'7^ hvXORs, XORV7XORV15 



chain value 




next 
chiain value 



s;3lt counter message chain value salt 
Figure 6: The Local Wide-Pipe Construction of Blake's Compression Function 



It has been estimated that five rounds is the minimum number that must be executed for BLAKE -28 & - 32 bit, 
although ten rounds are recommended. For BLAKE-48, -64 fourteen rounds are recommended, although the functions 
could operate sufficiently with seven rounds of data transformations. 

EVALUATION 

We have used simulation tool (Borland Turbo C) to show simulation work: It gives us C code computing the 
compression function of BLAKE-512 with Header File BLAKEh.h and Turbo file Turbo. c. Firstly, It calculates Go 
(vo, V4, Vg, V12) 
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Figure 7: Hash Values Obtained after Two Block Message 
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Figure 8: Hash Value Obtained after One-Block Message 
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Figure 9: Changed Hash Value Obtained after Two-Block Message 
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Figure 10: Changed Hash Value Obtained after One-Block Message 
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Figure 11: Changed Combined Hash Value of One & Two Block-Message 

CONCLUSIONS 

A cryptographic hash function is a hash function, that is, an algorithm that takes an arbitrary block of data and 
returns a fixed-size bit string, the (cryptographic) hash value, such that an (accidental or intentional) change to the data will 
(with very high probability) change the hash value. In this work the comparison of two SHA-3 cryptography hash function 
JH and BLAKE is done, which is less time consuming process. BLAKE-256 with 14 rounds instead of 64 rounds of 
SHA-256 is more reliable in comparison to SHA family. This work gives the comparative hash algorithms of both 
candidates. 

FUTURE WORK 

In future the working of the function (BLAKE-512) discussed here should be more less time consuming. It can 
merge all three function of BLAKE-512 into single unit then give one time input (512 bit) and get 512 bit final hash value 
using xilinx. 
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